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Consider  a  signal  x(t)  =  f(t)  +  w(t),  0  £  t  <  1.  Here  the  noise  w(t)  is 
an  independent  process,  and  f(t)  is  a  function  with  only  finitely  many 
jumps,  satisfies  a  Lipschitz’  condition  between  any  two  consecutive  jumps. 
This  paper  gives  an  algorithm  to  determine  the  number  ,  locations  and 
magnitudes  of  the  jumps  of  f(t).  The  consistency  and  speeds  of  convergence 
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DETECTION  OF  THE  NUMBER,  LOCATIONS 
AND  MAGNITUDES  OF  JUMPS 

Y.  Q.  Yin 


§1.  Introduction.  Jump  detection  or  change-point  detection  is  a  very  im¬ 
portant  problem  in  statistics  and  engineering.  The  problem  can  be  stated  in  the 
following  way: 

Let  x (t)  —  f{t)  -r  w(t)  be  a  stochastic  process,  0  <  t  <  1.  Here  u>(l)  is  the 
noise  process,  Ew[t)  =  0,  and  f(t)  —  Ex(t)  is  a  function  with  only  finitely  many 
discontinuities  <!,•••  tq.  Suppose  these  discontinuities  are  all  interior  points  of 
[0,  l)  and  for  each  i  =  1,  •  •  •  q,  /(<,  -fO),  f(ti  -  0)  exist  and  /(<,•  -f  0)  ^  f[tl  —  0). 
For  definiteness,  we  suppose  /  is  left  continuous  everywhere.  Our  problem  is  to 
estimate 

1.  the  number  q  of  discontinuities, 

2.  the  positions  t !,•••  , tq  of  these  discontinuities; 

3.  the  magnitudes  /(<,  +0)  -  /(£,  —  0)  of  the  jumps,  i  =  1,  •  •  •  ,  q; 
based  on  a  sample  x(  ^),  k  =  0, 1,  •  •  •  n. 

In  this  paper,  under  mild  conditions,  we  will  give  an  algorithm  to  estimate 
q ,  i.  e.  ,  define  an  estimator  q  of  q.  For  each  t  =  1,  •  •  •  q,  we  give  an  algorithm  to 
estimate  the  position  of  the  ith  discontinuity  point  t,,  i.  e.  ,  define  an  estimator 
£,■  of  ti\  we  also  define  an  estimator  Dnii  for  the  jump  at  the  ith  discontinuity 
point. 

We  will  prove  that  all  these  estimators  arc  strongly  consistent,  or  in  other 
words,  these  estimators  converge  to  the  corresponding  parameters  as  n,  the 
number  of  sample  points,  tends  to  infinity,  with  probability  one. 
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We  also  get  the  speed  of  convergence,  for  example  for  £,  we  get 


for  any  a  >  0. 

The  complexity  of  computation  of  our  algorithms  is  O(nlogn) 
approximately. 

The  basic  hypotheses  are  two: 

1.  w(t)  is  a  gaussian  white  noise. 

2.  There  exists  a  positive  constant  K  >  0,  such  that 

|/(i)  —  f{s)\  <  K\t  —  s],  if  no  £,  are  in  [s,tj. 

The  condition  1.  can  be  relaxed  to  nongaussian  white  noises,  but  it  would 
be  difficult  to  relax  2. 

In  Section  2,  we  define  the  algorithms.  In  Section  3  we  give  the  proofs. 
Works  on  this  topic  mostly  concentrate  on  the  single  jump  problem,  see  the 
references  listed  at  the  rear  of  this  paper.  Especially  no  work  has  been  done  on 
the  case  when  the  number  of  jumps  is  unknown. 

Acknowledgment.  The  author  sincerely  appreciates  the  encouragement  and 
financial  support  of  Dr.  Krishnaiah,  director  of  the  Center  for  Multivariate 
Analysis,  University  of  Pittsburgh. 

§2.  The  Method  for  Detecting  the  Change  Points.  Let  X(t)  —  r\c)  —  u'{t) 

be  a  stochastic  process,  0  <  t  <  1.  Here  w(t)  is  a  Gaussian  white  noise.  /(£)  - 
Ex(t)  has  only  finitely  many  discontinuities  *!,•••  ,£,.  All  these  discontinuities 
are  in  (0,1)  and  of  the  first  kind,  i.  e.,  /(£,  +0),  /(£*  -  0)  exist  for  i  = 
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Besides,  suppose  that  there  is  a  constant  K  >  0  such  chat  \f{s)  —f[t)  j  <  K\s  —  t 
for  any  interval  [s,f]  C  [0,  l]  not  containing  any  discontinuities.  Let 

di  =  f{U  +  0)  -  f(U  -  0)  ^  0,  i  =  1,  •  •  •  ,q, 

and  suppose 

|<fi|  >  1^2 1  >  •  •  •  >  |dq|,  and  if  |d,j  =  ldj|,  and  i  <  j,  then  £,•  <  tj. 


Let  m  =  m(n)  f  oc,  m/n  — >  0.  Define 

„  X(4J1)  +  ---  +  I(^)  xf*^1)  +  ■•••<-  if—2) 

-k'nfc  —  , 

m  m 

for  m  <  k  <  n  —  m. 

Let  hn  be  a  sequence  of  positive  numbers,  hn  — *  0,  — ~  — *  0,  for  deft- 

Tihji 

niteness,  let  hn  =  (ln3  n/ ln2  n) 1//3,  mn  =  In  n(in2  n)2,/3(ln3  n) 1/3  where  Inin  = 
1  A  Inn,  \lnkn  =  ln^ln^-i  n). 


Define 

I\  -  arg  max  ( |  Dnk  \ 


»**}’ 


h  =  arg  max  j|L>nfc|  -  ^ hn  :  |fc  -  >  4m|  , 

/3  —  arg  max  ||Dnfc| - hn  :  |fc  —  Ix\  >  4m  and  \k  -  /2j  >  4m|  . 

If  the  dehnition  is  not  unique,  we  choose  the  smallest  one.  At  first,  we  state  the 
main  theorem,  which  will  be  proved  in  Section  3. 

Theorem  3.1.  (l)  |-£  —  £,-|  <  ~  for  all  large  n,  a.  s,  i  =  1,  •  •  •  ,(/. 

(2)  DnI%  —  d{,  a.  s.,  i  —  1 ,  •  •  •  ,  q. 

(3)  Dn!i  =  0{hn),  a.  s.  for  i  >  q. 

We  see  from  this  theorem  that  the  quantities  defined  above  are  s'reng’y 

consistent  estimators  of  the  change  points  f  • ,  ■  •  *  ,  l,.  and  ti\o  magnitudes  u\  oi 

changements.  Furthermore  we  get  the  convergence  rate 

A  .  ^  2m  r  i 

- ti  <  — ,  for  large  n,  a.  s. 
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from  (1). 

Theorem  3.1  does  not  supply  a  method  to  estimate  the  integer  <7,  explicitly. 
But  based  on  Theorem  3.1,  we  can  construct  a  strongly  consistent  estimator  of 
q  in  the  following  manner. 

Let 

(4)  Gnk  =  +  +  2*~2  l-Dn./k  +  J  4 - f  kcn. 

Here  c„  >  0  with  the  properties  cn  — *  0  and  hn/cn  — *  0.  Let 

qn  =  argminGnfc. 

AC 

Suppose  we  have  proved  Theorem  3.1.  We  are  going  to  prove 


Theorem  2.1.  qn  —*  q,  a.  s. 

Proof.  If  k  <  q,  then,  almost  surely,  as  n  — *  00 


Gnk  Gnq  >  \Dnik+l !  4  t  2^ \Dniq  |  2k^n 

+  (k  —  q)cn  — *  2fe+ 1  ldk+i\  4-  •  •  •  *r  —  id<j :  >  0, 


by  (2)  of  Theorem  3.1.  That  means  qn  ==  k  for  large  n. 

If  k  >  q,  then,  noticing  \Dni,\  i,  as  t  by  (3)  of  Theorem  3.1,  we  have  for 
a  constant  C  >  0, 


Gnk  G nq  —  2'}fl  '  -yk  ^nlx-  ~  <l)cn 

>  Dnl'.x  -  {*  ~  >i)cn 

-  ~§hn  +  ^  ~  <]'>Cn  =  ((*  ~  7)  - 

>  ~  9)  >  0 
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for  n  sufficiently  large,  almost  surely.  In  this  case  qn  ^  k  either.  So,  almost 
surely,  for  n  sufficiently  large,  qn  =  q. 


§3.  Proof  of  the  Main  Theorem.  At  first  we  prove  an  elementary  lemma. 


Lemma.  Suppose  /ft)  is  defined  in  the  interval  (to  —  a,  to  a)  for  a  positive 
number  a.  Suppose  /(to— 0)  exist  and  are  finite.  Let  In  and  mn  be  two  sequences 
of  positive  integers  such  that  7n/n  — ►  to,  mnjn  — >  0,  mn  — *  oo.  Let 


=  --(/(— )  +  •••  +  /(— ) 

m\n  n 


n 


n 


where  I  —  m  —  mn.  Then,  from  liminf  >  \f[to  -r  0)  —  f[to  —  0) ' ,  we 

n  — *  oo 

can  deduce  that  An  — ►  f(t0  -f  0)  —  f[t0  -  0). 

Proof.  Without  loss  of  generality,  suppose  -  <  to  for  all  n.  Let 

I  +  k 


kn  =  max  <  k  : 


n 


<  to,  0  <  k  <  m  >  . 


!/(t)  -  /(to  -  0);  <  e,  and 


Fix  e  >  0,  36  >  0  such  that  to  -  <5  <  t  <  tQ  = 

<o  <  t  <  to  +  <5  =>  |/(t)  —  /(to  +  0)|  <  e. 

Let  JV  be  such  that  as  n  >  X,  j  ~  —  toi  <  f ,  ~~  <  § .  So,  as  n  >  -V 

<  —/(to  —  0)  +  e  +  (/(to  —  0)  +  e)  —  +  (l  — —  \  (/(to  t  0)  -  o) 

m  \  m  / 

=  ^1  -  (/(t0  4-  0)  -  /(t0  -  0))  -f  2e. 

In  the  same  way 

>  (l  "  m)  (/^°  0)  "  /(<0  “  0))  "  2t‘ 


Thus, 


\An\  < 


* m 
m 


|/(*o  +  0)  -  /(to-  0)!  4  2c. 
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Therefore  we  must  have  ^  — >  0,  thus  An  — >  f(to+0)  —  /(to  —  0). 


Theorem  3.1.  (1)  \Ik/n  —  £&[  <  for  all  large  n,  a.  s.,  for  k  =  1,  •  •  •  ,  q\ 

(2)  Dnik  -*  dk,  a.  s.,  for  k  -  1,  •  •  •  ,  q; 

(3)  Dnh  =  0(hn),  a.  s.,  for  k  >  q. 

Proof.  1.  In  this  part,  we  prove  |  ^-  —  £j|  <  ~  for  all  large  n,  a.  s. 
Introduce  the  following  notations 

a*  =  a  {/ (^)  +  ■•■  +  / (*%*)}  -  i  {/  (*=»)  +  •■•  +  / (H21)}. 

w*+  =  £M!Hr1)  +  -'-  +  ”’(!Lir!)}. 

Let  £  be  such  that  | k/n  —  <i|  =  min|A:/n.  —  i 1 1 ,  k  depends  only  on  n. 

k 

At  first,  we  note  that  for  \k/n  —  t\  \  >  2m/ n, 

W  P  (l^ntl  -  K  £  1^*1  -  J*») 

<  P  (|AS|  -  |A*|  +  (4=4)  K  <  F*+l  +  IFTl  +  Ffi  +  iW'fl)- 

Because  )£  -  £i)  >  the  points  ,“i,  ~ 1  are  all  on 

the  same  side  of  t j.  If  n  is  larger  than  some  nonrandom  number  >  0,  for 
any  k  with  x  —  ~|  >  ~ ,  the  interval  [ can  contain  at  most  one 
discontinuous  point  There  are  three  possibilities: 

(a)  No  ti  in  [i=m,i±a], 

(b)  1,  e  [4=®,4±»],  I </,-|  =  Id,!,  then  >  t„  and  k  >  k, 

(c)  ti€[^,k-^],\di\<\il\. 

For  case  (a)  above, 

|A4|>|d.|-2A'^, 


so, 

—  -i - '~hn  >  |dij  —  3 1\ - 

n  n 

for  n  >  N2  >  0,  Ar2  is  a  nonrandom  constant. 


k  -  k  1 

- hn  >  -  d 

n  2 
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For  case  (b) 


|Afc|  >  \dl\-2K-,  |Afc|  <  \dx\~2K-, 
n  n 


so,  for  a  constant  c  >  0, 


|Afc|  -  |Afc|  +  Qhn  >  -4K- 
K  n  n 

"T  (h  1 1 ) hn  t  o^hji}  ^  chT 


when  n  >  Ar3  >  0,  N2  is  nonrandom,  c  is  a  positive  constant. 

For  case  (c), 

|A^j  -  lAfci  -  -  --~-hn  >  jdjj  -  \di\  -  o(l) 

>  |(Mi!  -  Id,!)  >  0 

when  n  >  N4  >  0,  JV4  is  nonrandom. 

Thus,  for  n  >  max{ N\,  lV2,  1V3,  iV4),  and  for  some  constants  Cj  >  0,  c2  >  0, 


P\\Dn‘k\-  -hn<\Dnk\-  -hn 


<  4 P  bi  hn  < 


<  4  P 


V\  -r  •  •  •  -h  VT 


<  \s\ 


✓  o  *  —  ~  —k  m 

<8-—— - —  e  2  *-  " 

V2 Kb-fhnsfm 

<  b2c~b>  lnnln3n  =  b7n-b']n*n 

if  we  choose  hn  =  (npHr)  ,  m  =  in  n(lno  n)2/ 3 ( 1  n 3  n)  l>  3.  Here  b\,  b2,  b.> 
positive  constants,  In->  x  =  ln'nx,  ln3  x  =  lninx.  Th'as,  the  series 

P\\D  ?; - -hn  <  Dnk  -  -hn,  for  some  k  with  ^  -  id  >  - —  ) 

1 '  \  n  n  n  n 

n  x 


By  Borel-Cantelli  Lemma, 
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Since  |£  —  1 1  j  >  ]  £  —  t2|  >  1 1  and  t2  are  not  in  the  interval  {^f1,  ^r1.- 

3Ari  >  0  nonrandom,  such  that  as  n  >  Nlt  [^rri,  —f1]  contains  at  most  one 
discontinuity  point.  There  are  three  possibilities: 

(a)  Sol i  in  (ias.teBI, 

(b)  f,  e  [*=j®,  iisj.ldj]  =  'd2{,  of  course  •  >  2,  >  l5. 

(<0 

For  (a),  we  have 


9  >77  9 

|At!  >  !d2|  -  A — ,  |Afc|  <  K- — , 

K  n  n 


and  then  as  n  >  Ar2,  Ar2  nonrandom, 


,  A  r  !  —  A  >. 


k  -  k ,  ...  A  K  m  1  , 

- hn  >  d2: - /irt  A  -'^Si¬ 
nn  2 


For  (b), 


|A  j  >  |<f2j  _  K- — ,  |Afci  <  |d2;  +  A'  —  , 


and  as  n  >  Ar3,  AT3  nonrandom,  we  have 

,  ,  k  —  k ,  4m  k  —  k , 

Ar  -  A*  -  - hn  >  -I<  —  4- - hn 

*  n  n  n 


>  -K 


ti  —  <2  -f  o(l)  )  hn  >  bhn , 


for  constant  6  >  0. 

For  case  (c),  just  in  the  same  way,  we  can  show  that  as  n  >  A.i,  A.j 


nonrandom, 


|Aj4  —  j  A  fc  1  4-  — - -hn  >  -(/i2  -  d,;)  >  0. 


therefore  a s  n  >  mar(Aj,  AT2,  A's,  A'.i), 


P  ,£>  A  -  -/in  <  Dtlk\-~hn 


<  AP 


(b].hn 


t' 1  4- 


<  AP[b2hn\rrn  <  A|)  <  b2n 


—  (>3  hi~  n 
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Therefore  in  [ f ^  —  etajeirs]  there  are  no  discontinuity  points,  so 

<k2-2-  +  \w^\  +  \wt„,\, 

and 

<E/>(A»£«’V  +  l,l''r-l  +  |ll'v-l) 

<6E„p(m„<|-±^±-|) 

n  \  ,  / 

<  nP{b2\/rnhn  <  \z\)  <  oo. 

n 

Therefore 

!  =  0{hn)y  a-  5* 
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